skip to main content


Search for: All records

Creators/Authors contains: "Liu, Sanzhen"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Abstract

    We employed several algorithms with high efficacy to analyze the public transcriptomic data, aiming to identify key transcription factors (TFs) that regulate regeneration inArabidopsis thaliana. Initially, we utilized CollaborativeNet, also known as TF-Cluster, to construct a collaborative network of all TFs, which was subsequently decomposed into many subnetworks using the Triple-Link and Compound Spring Embedder (CoSE) algorithms. Functional analysis of these subnetworks led to the identification of nine subnetworks closely associated with regeneration. We further applied principal component analysis and gene ontology (GO) enrichment analysis to reduce the subnetworks from nine to three, namely subnetworks 1, 12, and 17. Searching for TF-binding sites in the promoters of the co-expressed and co-regulated (CCGs) genes of all TFs in these three subnetworks and Triple-Gene Mutual Interaction analysis of TFs in these three subnetworks with the CCGs involved in regeneration enabled us to rank the TFs in each subnetwork. Finally, six potential candidate TFs—WOX9A, LEC2, PGA37, WIP5, PEI1, and AIL1 from subnetwork 1—were identified, and their roles in somatic embryogenesis (GO:0010262) and regeneration (GO:0031099) were discussed, so were the TFs in Subnetwork 12 and 17 associated with regeneration. The TFs identified were also assessed using the CIS-BP database and Expression Atlas. Our analyses suggest some novel TFs that may have regulatory roles in regeneration and embryogenesis and provide valuable data and insights into the regulatory mechanisms related to regeneration. The tools and the procedures used here are instrumental for analyzing high-throughput transcriptomic data and advancing our understanding of the regulation of various biological processes of interest.

     
    more » « less
    Free, publicly-accessible full text available November 22, 2024
  2. Free, publicly-accessible full text available May 1, 2024
  3. Abstract

    Four statistical selection methods for inferring transcription factor (TF)–target gene (TG) pairs were developed by coupling mean squared error (MSE) or Huber loss function, with elastic net (ENET) or least absolute shrinkage and selection operator (Lasso) penalty. Two methods were also developed for inferring pathway gene regulatory networks (GRNs) by combining Huber or MSE loss function with a network (Net)-based penalty. To solve these regressions, we ameliorated an accelerated proximal gradient descent (APGD) algorithm to optimize parameter selection processes, resulting in an equally effective but much faster algorithm than the commonly used convex optimization solver. The synthetic data generated in a general setting was used to test four TF–TG identification methods, ENET-based methods performed better than Lasso-based methods. Synthetic data generated from two network settings was used to test Huber-Net and MSE-Net, which outperformed all other methods. The TF–TG identification methods were also tested with SND1 and gl3 overexpression transcriptomic data, Huber-ENET and MSE-ENET outperformed all other methods when genome-wide predictions were performed. The TF–TG identification methods fill the gap of lacking a method for genome-wide TG prediction of a TF, and potential for validating ChIP/DAP-seq results, while the two Net-based methods are instrumental for predicting pathway GRNs.

     
    more » « less
  4. Abstract

    Goss's wilt, caused by the Gram-positive actinobacterium Clavibacter nebraskensis, is an important bacterial disease of maize. The molecular and genetic mechanisms of resistance to the bacterium, or, in general, Gram-positive bacteria causing plant diseases, remain poorly understood. Here, we examined the genetic basis of Goss's wilt through differential gene expression, standard genome-wide association mapping (GWAS), extreme phenotype (XP) GWAS using highly resistant (R) and highly susceptible (S) lines, and quantitative trait locus (QTL) mapping using 3 bi-parental populations, identifying 11 disease association loci. Three loci were validated using near-isogenic lines or recombinant inbred lines. Our analysis indicates that Goss's wilt resistance is highly complex and major resistance genes are not commonly present. RNA sequencing of samples separately pooled from R and S lines with or without bacterial inoculation was performed, enabling identification of common and differential gene responses in R and S lines. Based on expression, in both R and S lines, the photosynthesis pathway was silenced upon infection, while stress-responsive pathways and phytohormone pathways, namely, abscisic acid, auxin, ethylene, jasmonate, and gibberellin, were markedly activated. In addition, 65 genes showed differential responses (up- or down-regulated) to infection in R and S lines. Combining genetic mapping and transcriptional data, individual candidate genes conferring Goss's wilt resistance were identified. Collectively, aspects of the genetic architecture of Goss's wilt resistance were revealed, providing foundational data for mechanistic studies.

     
    more » « less
  5. Abstract

    Understanding gene regulatory networks is essential to elucidate developmental processes and environmental responses. Here, we studied regulation of a maize (Zea mays) transcription factor gene using designer transcription activator-like effectors (dTALes), which are synthetic Type III TALes of the bacterial genus Xanthomonas and serve as inducers of disease susceptibility gene transcription in host cells. The maize pathogen Xanthomonas vasicola pv. vasculorum was used to introduce 2 independent dTALes into maize cells to induced expression of the gene glossy3 (gl3), which encodes a MYB transcription factor involved in biosynthesis of cuticular wax. RNA-seq analysis of leaf samples identified, in addition to gl3, 146 genes altered in expression by the 2 dTALes. Nine of the 10 genes known to be involved in cuticular wax biosynthesis were upregulated by at least 1 of the 2 dTALes. A gene previously unknown to be associated with gl3, Zm00001d017418, which encodes aldehyde dehydrogenase, was also expressed in a dTALe-dependent manner. A chemically induced mutant and a CRISPR-Cas9 mutant of Zm00001d017418 both exhibited glossy leaf phenotypes, indicating that Zm00001d017418 is involved in biosynthesis of cuticular waxes. Bacterial protein delivery of dTALes proved to be a straightforward and practical approach for the analysis and discovery of pathway-specific genes in maize.

     
    more » « less
  6. Summary

    The fungal pathogen,Magnaporthe oryzae Triticumpathotype, causing wheat blast disease was first identified in South America and recently spread across continents to South Asia and Africa. Here, we studied the genetic relationship among isolates found on the three continents.

    Magnaporthe oryzaestrains closely related to a South American field isolate B71 were found to have caused the wheat blast outbreaks in South Asia and Africa. Genomic variation among isolates from the three continents was examined using an improved B71 reference genome and whole‐genome sequences.

    We found strong evidence to support that the outbreaks in Bangladesh and Zambia were caused by the introductions of genetically separated isolates, although they were all close to B71 and, therefore, collectively referred to as the B71 branch. In addition, B71 branch strains carried at least one supernumerary mini‐chromosome. Genome assembly of a Zambian strain revealed that its mini‐chromosome was similar to the B71 mini‐chromosome but with a high level of structural variation.

    Our findings show that while core genomes of the multiple introductions are highly similar, the mini‐chromosomes have undergone marked diversification. The maintenance of the mini‐chromosome and rapid genomic changes suggest the mini‐chromosomes may serve important virulence or niche adaptation roles under diverse environmental conditions.

     
    more » « less
  7. null (Ed.)
    Drought stress is a major constraint in global maize production, causing almost 30–90% of the yield loss depending upon growth stage and the degree and duration of the stress. Here, we report that ectopic expression of Arabidopsis glutaredoxin S17 (AtGRXS17) in field grown maize conferred tolerance to drought stress during the reproductive stage, which is the most drought sensitive stage for seed set and, consequently, grain yield. AtGRXS17-expressing maize lines displayed higher seed set in the field, resulting in 2-fold and 1.5-fold increase in yield in comparison to the non-transgenic plants when challenged with drought stress at the tasseling and silking/pollination stages, respectively. AtGRXS17-expressing lines showed higher relative water content, higher chlorophyll content, and less hydrogen peroxide accumulation than wild-type (WT) control plants under drought conditions. AtGRXS17-expressing lines also exhibited at least 2-fold more pollen germination than WT plants under drought stress. Compared to the transgenic maize, WT controls accumulated higher amount of proline, indicating that WT plants were more stressed over the same period. The results present a robust and simple strategy for meeting rising yield demands in maize under water limiting conditions. 
    more » « less
  8. Abstract Objectives

    This report provides information about the public release of the 2018–2019 Maize G X E project of the Genomes to Fields (G2F) Initiative datasets. G2F is an umbrella initiative that evaluates maize hybrids and inbred lines across multiple environments and makes available phenotypic, genotypic, environmental, and metadata information. The initiative understands the necessity to characterize and deploy public sources of genetic diversity to face the challenges for more sustainable agriculture in the context of variable environmental conditions.

    Data description

    Datasets include phenotypic, climatic, and soil measurements, metadata information, and inbred genotypic information for each combination of location and year. Collaborators in the G2F initiative collected data for each location and year; members of the group responsible for coordination and data processing combined all the collected information and removed obvious erroneous data. The collaborators received the data before the DOI release to verify and declare that the data generated in their own locations was accurate. ReadMe and description files are available for each dataset. Previous years of evaluation are already publicly available, with common hybrids present to connect across all locations and years evaluated since this project’s inception.

     
    more » « less
  9. null (Ed.)
    Abstract Genome sequences provide genomic maps with a single-base resolution for exploring genetic contents. Sequencing technologies, particularly long reads, have revolutionized genome assemblies for producing highly continuous genome sequences. However, current long-read sequencing technologies generate inaccurate reads that contain many errors. Some errors are retained in assembled sequences, which are typically not completely corrected by using either long reads or more accurate short reads. The issue commonly exists, but few tools are dedicated for computing error rates or determining error locations. In this study, we developed a novel approach, referred to as k-mer abundance difference (KAD), to compare the inferred copy number of each k-mer indicated by short reads and the observed copy number in the assembly. Simple KAD metrics enable to classify k-mers into categories that reflect the quality of the assembly. Specifically, the KAD method can be used to identify base errors and estimate the overall error rate. In addition, sequence insertion and deletion as well as sequence redundancy can also be detected. Collectively, KAD is valuable for quality evaluation of genome assemblies and, potentially, provides a diagnostic tool to aid in precise error correction. KAD software has been developed to facilitate public uses. 
    more » « less